Device, system, and method of capturing the moment in audio discussions and recordings

ABSTRACT

Devices, systems, and methods of Capturing the Moment in audio discussions and recordings. A user utilizes an electronic device to record audio, or to participate in an audio conversation or an audio/video conversation. During the conversation, the user clicks on presses a button to create an audio bookmark at desired time-points. Later, the system generates separate a short audio-clip for a few sentences that were spoken before and after each such audio bookmarks. The system further generates an aggregated clip or summary of the bookmarked segments, as well as textual transcription of the bookmarked content.

FIELD

The present invention is related to the field of audio processing.

BACKGROUND

Millions of people utilize mobile and non-mobile electronic devices,such as smartphones, tablets, laptop computers and desktop computers, inorder to perform various activities. Such activities may include, forexample, browsing the Internet, sending and receiving electronic mail(email) messages, taking photographs and videos, engaging in a videoconference or a chat session, playing games, or the like.

SUMMARY

The present invention may include, for example, systems, devices, andmethods for recording, processing and/or bookmarking various types ofaudio streams or audio segments, such as audio or audio/videodiscussions, conversations, meetings, audio conferences, videoconferences, or the like.

For example, a user utilizes an electronic device to record audio, or toparticipate in an audio conversation or an audio/video conversation.During the conversation, the user may click a designated button to markbookmarks at desired points in time. Later, the system generatesseparately a short audio and/or video and/or content clip that includesa few sentences that were spoken before and after each such audiopoint-in-time of interest. The system generates an aggregated clip orsummary of the bookmarked segments, as well as textual transcription ofthe bookmarked content.

The present invention may provide other and/or additional benefits oradvantages.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic block-diagram illustration of a system, inaccordance with some demonstrative embodiments of the present invention.

FIGS. 2A-2F are flow-charts demonstrating operations of methods inaccordance with some embodiments of the present invention.

DETAILED DESCRIPTION OF SOME DEMONSTRATIVE EMBODIMENTS

The Applicants have realized that a user of an electronic device, suchas a computer or a smartphone, sometimes attends a lengthy audioconversation or audio/video meeting, and may wish to selectively mark orbookmark or tag particular time-points or time-slots in such lengthyconversation that are of greater interest for him, and that he wouldlater desire to retrieve rapidly and efficiently.

Reference is made to FIG. 1, which is a schematic block-diagramillustration of a system 100, in accordance with some demonstrativeembodiments of the present invention. System 100 may comprise two ormore End-User Devices 111-112; for example, a first end-user device 111operated by a first user, and a second end-user device 112 operated by asecond user. Each end-user device may be, for example, a smartphone, atablet, a laptop computer, a desktop computer, a smart-watch, or otherelectronic device or other communication device that enables its user toparticipate in an audio conversation and/or in a video conversationand/or in an audio/video conversation with one or more other users. AConferencing Server 120 operates to establish and enable the connectionbetween the end-user devices 111-112.

Conferencing Server 120 further comprises, or is associated with, aConversation Recording Unit 121, which records the audio (and/or thevideo) of the ongoing conversation. The recorded audio is stored (e.g.,as an audio file; optionally compressed and/or encoded) in a RecordedConversations Repository 122. Furthermore, a Recorded ConversationRetrieval Unit 123 enables a user to selectively retrieve, from theRecorded Conversations Repository 122, a particular previously-recordedaudio recording of a previous audio (and/or video) conference ordiscussion, and to listen to such recording or to user-selected segmentsthereof.

For each recorded audio conversation, the Conferencing Server 120further stores in the Recorded Conversations Repository 122, or in alist or table that points to it, the relevant meta-data for each suchrecorded conversation; for example, date-stamp and time-stamp in whichthe audio recording started; names and/or usernames and/or screen-namesof participants; phone numbers and/or email addresses of participants;device-type or application-type per participant (e.g., User Adamparticipated from his smartphone using a mobile application; User Bobparticipated from his laptop via a browser extension); entry time and/orexit time of participants that joined or left the conversation; or thelike. Accordingly, the Recorded Conversation Retrieval Unit 123 enablesa user to sort or filter or search or browser the Recorded ConversationsRepository 122 based on such meta-data, in order to selectively retrievea particular recorded conversation.

In a first demonstrative example, User Adam is utilizing his End-UserDevice 111 to engage in an audio conference with one or more otherusers; for example, with User Bob who operates his End-User Device 112.During such conversation, User Adam utilizes for this purpose aConferencing Application 113 on his End-User Device 111; such as, amobile “app” or mobile application, or a stand-alone application orsoftware module, or a conferencing application that is implemented as anin-browser application (e.g., using HTML5 and JavaScript and CSS), orthat is implemented as a browser plug-in or a browser extension or abrowser add-on.

Conferencing Application 113 generates and utilizes an on-screenConversation User Interface (UI)/Graphic User Interface (GUI) 114, whichmay include graphical components and/or textual components; for example,a “mute/unmute my microphone” button that enables User Adam to mute andunmute his own audio input that is captured by an acoustic microphone ofEnd-User Device 111; a “leave” or “exit” or “end call” button thatenables User Adam to end the conversation or to leave it; a “sharescreen” button that optionally enables User Adam to share his screenwith other participants; a “camera on/off” or a “video on/off” buttonthat optionally enables User Adam to add a video stream to theconversation, such as a live video stream captured by an imager or acamera of his End-User Device 111; a “chat” button or interface orpanel, which enables User Adam to concurrently engage in a textualexchange of on-screen textual messages with one or more otherparticipants, in parallel to the ongoing audio conversation; and/orother UI or GUI components or elements.

In accordance with the present invention, the Conversation UI/GUI 114that is displayed on the screen of the End-User Device 111 during anongoing conversation, includes a UI/GUI element which may be referred toas a “Capture the Moment” GUI element, or as a “Seize the Moment” GUIelement, or as a “Capture the Moment” GUI Element 115. For example,after 5 minutes into a lengthy conversation, User Adam has just noticedthat User Bob said an important sentence, that is of importance to UserAdam, and that User Adam may wish to later retrieve efficiently andrapidly; and therefore, User Adam clicks (or touches, or taps) on the“Capture the Moment” GUI element 115 on the screen of his End-UserDevice 115. Later, after 3 more minutes into the conversation, User Adamsays an important sentence that he would like to later retrieve oraccess efficiently; therefore, User Adam again clicks (or touches, ortaps) on the “Capture the Moment” GUI element 115 on the screen of hisEnd-User Device 115.

In response to each such click (or tap, or selection) performed by UserAdam on his End-User Device 111, the System 100 creates and stores abookmark or a pointer to the particular time-point at which the “Capturethe Moment” GUI element 115 was engaged by User Adam. For example, aBookmarking Unit 125 may be implemented as a server-side unit (e.g., aspart of Conferencing Server 120) and/or as a client-side unit (e.g., aspart of the Conferencing Application 113 running on the End-User Device111).

Bookmarking Unit 125 creates and/or modifies an Audio Bookmarks Log 126,which is created separately and is updated separately for each recordedaudio conversation and for each participant. For example, for aparticular audio conversation that took place between Adam and Bob, theBookmarking Unit 125 creates (and updates) a first Audio Bookmarks Logwhich logs the time-points in which User Adam has requested to “capturethe moment” or create audio bookmarks; and, separately, the BookmarkingUnit 125 creates (and updates) a second Audio Bookmarks Log which logsthe time-points in which User Bob has requested to “capture the moment”or create audio bookmarks. Each such Audio Bookmarks Log 126 storespointers or indicators of the relevant time-points in which the relevantuser has requested to “capture the moment”; it is thus a per-user,per-conversation, Audio Bookmarks Log.

In the above example, the Audio Bookmarks Log 126 of User Adam mayinclude two audio bookmarks: a first audio bookmark which points to 5minutes into the audio conversation; and a second bookmarks which pointsto 8 minutes into the audio conversation. User Bob may create, inparallel, his own audio bookmarks for that same conversation; such as,User Bob may request to “capture the moment” at other time-points duringthat conversation, and a separate and different Audio Bookmarks Log iscreated and updated for User Bob with regard to that audio conversation.

In some embodiments, each per-user Audio Bookmarks Log is stored onlylocally at the End-User Device of the relevant user who created thoseaudio bookmarks. In other embodiments, each per-user Audio Bookmarks Logis stored locally at the End-User Device of the relevant user whocreated those audio bookmarks, and also at the Recorded ConversationsRepository 122, such as, as additional meta-data file or meta-data listthat accompanies the recorded audio conversation, and is only availableaccessible by the user who created those audio bookmarks. In someembodiments, optionally, a user may selectively share his AudioBookmarks Log, or parts of it, with other users or participants.

Once the audio conversation has ended, User Adam may request from hisEnd-User Device to provide him with a collection or a summary or anaggregation of all the “moments” that he selectively “captured” duringthe audio conversation. For example, User Adam may push or click or tapor select a “Generate Collection of Captured Moments” GUI Element 116,which in case sends a “generate collection of captured moments” commandor signal to the Conferencing Server 120 or to other component of System100, or otherwise triggers or initiates such collection.

Upon such signal or command, a Moments Collector Unit 127 operates togenerate a Collection of Captured Moments based on the particular AudioBookmarks Log 126 of that requesting user for that recorded audioconversation. For example, the Moments Collector Unit 127 obtains orretrieves the recorded audio file of that entire conversation; and thenextracts from it particular audio-segments that preceded and followedeach and every one of the time-points that were bookmarked by thatspecific user.

For example, the Moments Collector Unit 127 may extract, for eachtime-point, an audio segments that begins T1 seconds before the bookmarktime-point (e.g., 10 or 15 seconds before it), and that ends T2 secondsafter the bookmark time-point (e.g. 5 or 8 seconds after it); therebygenerating an audio segment of approximately 15 or 20 seconds of theconversation that “surrounded” each such audio bookmark time-point. Insome embodiments, the values of T1 and/or T2 may be hard-coded, or maybe user-configurable or user-modifiable; for example, User Adam mayconfigure that he wants his “moments” audio segments to include 30seconds before and after each audio bookmark of his; whereas, User Bobmay configure that he wants his “moments” audio segments to include 18seconds before and 7 seconds after each audio bookmark of his.

In other embodiments, the Moments Collector Unit 127 may operate inassociation with a Speaker Recognition Unit 128, which identifies orrecognizes which participant has spoken at each particular time ortime-period or time-slot during the conversation. For example, theSpeaker Recognition Unit 128 may keep track of each speaker in real-timeas he utters speech during the conversation itself (e.g., while theaudio is also being recorded), based on the source of the audio streamof each participant; such as, by logging that User Bob has spoken from14:30 to 14:35 (in format MM:SS) since it was the audio stream that wasincoming from his Smartphone that generated the audio that can be heardduring that time-slot. In other embodiments, optionally, the SpeakerRecognition Unit 128 may utilize other methods of speakeridentification, in order to correlate between a particular audiotime-slot and its respective speaker; for example, based on an audiosignature or an audio profile that is pre-constructed for eachparticipant, or based on gender recognition or gender identification(e.g., identifying that the speaker at 12:48 is a female speaker, andcorrelating it with User Janet who is the only female participant inthis conversation, based on user names or participant names and theircorrelation with genders); or based on other Speaker Identificationalgorithms or Speaker Recognition algorithms.

Accordingly, the Moments Collector Unit 127 may perform the followingoperations, for each of the Audio Bookmarks of a particular AudioBookmarks Log 126: (i) obtain from the Speaker Recognition Unit 128 anindication of the identity of the particular participant, who wastalking at the time-point of the audio bookmark; (ii) obtain from theSpeaker Recognition Unit 128, the exact time-point T1 in which thatparticular participant has started speaking, and the exact time-point T2in which he stopped speaking; (iii) extract from the recorded audioconversation, an audio segment that is from time-point T1 to time-pointT2; thereby including the entire utterance that this particularparticipant has said, and during which User Adam has commanded to“capture the moment”. In some embodiments, the above operations may beperformed by a sub-unit, such as a Per-Participant Utterance ExtractorUnit 129, which may be part of the Moments Collector Unit 127 or may beoperably associated therewith. In some embodiments, optionally, thePer-Participant Utterance Extractor Unit 129 may be configured by thesystem, or by each user, to have a time-limit (e.g., up to 60 seconds)for each extracted audio segment; in order to avoid a situation inwhich, for example, User Bob has spoken without interruption for 12minutes, and User Adam captured the moment in the middle of that speech;and in such situation, only a particular time-slot (e.g., 60 secondslength) that surrounds that time-point would be extracted, rather thanextracting the entire 12 minutes.

In some embodiments, the Per-Participant Utterance Extractor Unit 129,and/or other suitable unit(s) or component(s) of the system, mayoptionally operate to extend or to increase or to contract or to shortenor to decrease or to otherwise modify or adjust, the time-length of atime-slot that surrounds an audio bookmark, from which audio and/orvideo and/or text are extracted or isolated; and/or to move or adjust orto postpone or to delay a commencing time-point that precedes the markedaudio bookmark; and/or to move or adjust or to postpone or to delay anending time-point that precedes the marked audio bookmark; for example,performing such modifications or adjustments of time-slot boundariesautomatically (e.g., based on analysis of the audio and determination ofsilence periods or silence gaps, and/or based on detecting a change inthe current speaker), and/or performing such modifications based on userinput. For example, User Adam is interested in shorter time-slots ofextracted content; and therefore, he may configure the system that foreach of his audio bookmarks, the time-slot of extracted audio shouldcommence 5 seconds before the audio bookmark and should end 5 secondsafter the audio bookmark. In contrast, User Bob is interested in longertime-slots of extracted content; and therefore, he may configure thesystem that for each of his audio bookmarks, the time-slot of extractedaudio should commence 14 seconds before the audio bookmark and shouldend 12 seconds after the audio bookmark. The system may thus enable eachuser to configure the length of time-slots, that surround each audiobookmark of the user, from which text and/or audio and/or video areextracted for the purpose of summarizing the conversation and/orgenerating discrete clips. Additionally or alternatively, suchadjustment or modification of time-slot boundaries may be performedautomatically or autonomously by the system, based on detection of theactive speaker; for example, User Adam has marked an audio bookmark attime-point T1; the system detects that at this time point, the activespeaker was Participant David; the system also detects automaticallythat Participant David has spoken in the 4 seconds that precededtime-point T1, and in the 3 seconds that followed time-point T1;therefore, the system may automatically and/or autonomously adjust ormodify the length of the time-slot that is allocated to this audiobookmark of User Adam, to include those 7 seconds (four seconds beforethe bookmark and three seconds after the bookmark); even though ingeneral the system may be configured to extract time-slots of aparticular length (e.g., 15 seconds before and 15 seconds after eachaudio bookmark), and/or even though User Adam had configured aparticular set of time margins to generally apply to his own audiobookmarks (e.g., generally using 12 seconds before and 8 seconds after,for his audio bookmarks). Adjustments or modifications of thetime-slots, that surround each audio bookmark, may be based on othercriteria or conditions; and such time-slots may have differentialtime-length, such that a first audio bookmark of User Adam will beassociated with a first time-length (N1 seconds) of content thatsurrounds it, whereas a second audio bookmark of User Adam for the sameaudio conversation will be associated with a second, different,time-length (N2 seconds) of content that surrounds it.

The Moments Collector Unit 127 may thus generate and collect discreteaudio segments, which correlate to before-and-after utterances relativeto each bookmarked time-point. In some embodiments, the MomentsCollector Unit 127 may trigger an Aggregated Audio Clip Generator 130 toautomatically generate a single aggregated audio clip, that includes allthose bookmarked audio portions (e.g., concatenated one after the other,in sequence); thereby allowing User Adam to hear, for example, afive-minute audio segment that contains several highlights of the audioconversation that he had bookmarked, out of a lengthy 60-minuteconversation that he attended.

In other embodiments, instead of (or in addition to) generating a singleaggregated audio clip, the Moments Collector Unit 127 may trigger aDiscrete Audio-Clips Generator 131, to generate a set of discrete orseparate audio-clips or audio-files, each one of them corresponding toone of the bookmarked time-points. The list of such discrete audio clipsmay be shown to the requesting user (who created those audio bookmarks);and the system 100 may enable him to selectively download one, or some,or all of those short audio-clips or audio-segments, as separate audiofiles (e.g., as MP3 files), which he can then save or store on hisdevice, and/or that he may selectively share with other people or send(e.g., as an attached file) to other people, or that he may post onsocial networks or social media websites, or may otherwise utilize.

In some embodiments, System 100 may further comprise a TranscriptGenerator 132, which utilizes a speech-to-text engine or an AutomaticSpeech Recognition (ASR) unit to generate a transcript of the entireaudio conversation, or to selectively transcribe only the audio-segmentsthat surround each of the bookmarked time-points. For example,Transcript Generator 132 may generate a transcript of the entire audioconversation; and the Moments Collector Unit 127 may then obtain fromsuch transcript of the transcript-portion that correspond to eachaudio-segment that surrounds each bookmarked time-point, e.g., thetranscript of the conversation that started T1 seconds before thebookmarked time-point and that ended T2 seconds after the bookmarkedtime-point; or, the transcript portion that corresponds to the utterancethat was said, just prior to that bookmarked time-point, by theparticipant that is identified by the Speaker Recognition Unit 128 asthe person that spoke during that time-point. The Moments Collector Unit127 may aggregate all those transcript-portions, that correspond topre-defined time-periods that surround each audio bookmark, and mayutilize a Transcript-Portions Collector Unit 133 to generate anaggregated textual output that includes those textual portions of thetranscript; and such aggregated textual item may be presented on thescreen of the requesting user (who created that audio bookmarks log), ormay be available for copying or for saving or downloading or sharing orposting.

In some embodiments, the user who creates the audio bookmarks, mayfurther utilize his end-user device in order to name or tag them, usingan Audio Bookmarks Renaming/Tagging Unit 134; for example, via aninterface that allows him to add a meta-data item, or one or more titlesor keywords or textual descriptors, to each such audio bookmark that hecreated, and/or to the discrete audio clips that are later presented tohim for his further usage. Accordingly, for example, User Adam may nameor tag his first audio bookmark with a title or a tag or a descriptor of“Bob Approved the Budget”; and he may then name or tag his second audiobookmark with a title or a tag or a descriptor of “Carl Objected to theProposal”. Such tags or titles or descriptors may be saved or stored, asmeta-data or as tags or keywords or titles, in relation to each suchrecorded audio conversation and in relation to each user that createshis own audio bookmarks log. Furthermore, the system may further enableUser Adam to browser or sort or filter or search the repository of audiorecordings, based on such tags or titles or descriptors that he created;for example, User Adam may query the system for all the audio recordingsin which he has created an Audio Bookmark that he named (or tagged) with“Budget”; and the system may find and fetch the audio recordings thatmeet such search criteria, and may further present to User Adam theparticular audio-segments that correspond to the particular searchedterm or keyword or tag. For example, a Cross-Conversations Search Unit135 may search multiple recorded audio conversations, and may find allthose conversations in which User Adam has created an audio bookmarkthat he tagged with the keyword “Budget”; and may collect from thosemultiple audio conversations, only the audio-segments that correspond tothe “Budget” related bookmarks; thereby providing to User Adam anefficient and concise list of those audio segments that pertain to“Budget” across multiple different audio recordings of differentconversations. In some embodiments, the Cross-Conversations Search Unit135 may further generate and provide to User Adam a cumulative list ofthe transcript-portions, that correspond to those audio-portions that hehad bookmarked across multiple audio conversations, for his furtherutilization, saving, storing, copying, downloading, or posting.

It is noted that each participant in the audio conversation may performhis “capture the moment” audio bookmarking operations in a confidentialmanner or in a secretive manner, by rapidly and efficiently clickingonce (per audio bookmark) on the “Capture the Moment” GUI element on thescreen of his own end-user device; without having to speak or to make avoice command or a voice indication that this participant has justcreated this audio bookmark for his own private usage; and without otherparticipants being aware that he created such audio bookmark; andwithout such audio bookmarks, their existence, and their timing, beingavailable or accessible to other users or to other participants (unlessthe creating user has explicitly shared them or allowed access to them).

In another demonstrative set of embodiments, audio bookmarks for“capturing the moment” may be added or created with regard to an audioconversation, not only in real time during an ongoing audioconversation, or not necessarily in real time during an ongoing audioconversation. Rather, audio bookmarks for “capturing the moment” may beadded or created in retrospect with regard to an already-finishedalready-recorded audio conversation, by a user that listens inretrospect to the audio recording of such conversation. For example,User Carla may user her end-user device to listen to a recorded audioconversation that took place between participants Adam and Bob; andduring such playback, User Carla may utilize the “Capture the Moment”GUI element on her end-user device to create her own audio bookmarks forsuch audio recording; thereby generating for User Carla, who did notparticipate in that audio conversation, her own, private, AudioBookmarks Log 126, which may be stored on her own end-user device and/ormay be stored in the Recorded Conversations Repository 122 in relationto that particular audio conversation (e.g., in a manner that is notaccessible to Adam or Bob, or to other users, without the specificapproval of User Carla who created such private audio bookmarks). Forexample, some or all of the components and/or the functionalities thatare described above, may be incorporated into a Playback Application136, without requiring the bookmarking user to utilize an audioconferencing application.

In some embodiments, the audio bookmarking “capture the moment”functionalities may be incorporated into other types of applications ordevices; for example, they may be part of a device or an application fordictating a lecture or for taping or recording a lecture or a speech, orthey may be a “dicta-phone” application (or a “dicta-phone” device; or adevice or application that records an audio conversation) that recordsaudio spoken by one person or by several persons, or may be part of anaudio recording device or application, or may be part of an audiotranscribing device or application, or may be part of an Audio RecordingApplication 137 of other suitable type or purpose. For example, a usermay record the audio of a face-to-face conversation between himself andone or more other persons (e.g., business meeting, personal meeting, jobinterview, personal interview, police investigation or law enforcementinvestigation, police interrogation or law enforcement interrogation,recording of a lecture or a speech given by a speaker or a professor ora teacher or a political speaker, or the like), and that person (e.g.,which takes part actively in such audio conversation as an activeparticipant and speaker, and/or which takes part passively in such audioconversation as a passive non-participating listener or as an audiencemember) may push or press or click on the “capture the moment” button orGUI element in order to create audio bookmarks at particular time-pointsduring the ongoing audio conversation or discussion; for subsequentreview and collection of those particular audio-segments.

In some embodiments, the Bookmarking Unit 126, and/or other componentsof the system, may optionally provide to a participant the ability toindicate, in real time while an audio conversation is ongoing, that anextension of a prolonging or a forward-stretching of a time-slot of anaudio bookmark should be performed. For example, the Bookmarking Unit126 may generated a first GUI element for creating a new audio bookmark;and a second GUI element for indicating a user request or a usersuggestion for time extension of the time-slot that is associated withthe most recently-indicated audio bookmark. User Adam is a participantin a conversation; at time-point 2:30 (two and a half minutes) User Adamclicks or presses or touches the “Create Audio Bookmark” button, sinceUser Adam has just heard User Bob say a sentence of interest. Regularly,the system is configured, in general for all users, or (in someembodiments) for the audio bookmarks of User Adam, to associate eachaudio bookmark with a time-slot of length L (e.g., 30 seconds), whichstarts L1 seconds (e.g., 20 seconds) before the audio bookmark commandwas inputted, and which ends L2 seconds (e.g., 10 seconds) after theaudio bookmark command was inputted. However, as User Bob continues totalk, the participant User Adam notices that the original utterance orsentence is continuing to be of interest, and that the time-slot that isassociated with this particular bookmark should be extended orprolonged; for example, to include 60 seconds of time-length, instead ofthe regular 30 seconds. Accordingly, User Adam presses or clicks ortouches or otherwise selects the GUI element of “Time Extension forMost-Recent Audio Bookmark”, and this action indicates to the systemthat the length L of the time-slot of the most-recent audio bookmarkshould be extended; for example, by adding a pre-defined extensionperiod (e.g., adding 20 seconds to the time slot, upon every additionalpress of that GUI element of “Time Extension for Most-Recent AudioBookmark”), or by doubling the original time-length L (or by multiplyingit by a pre-defined factor that is larger than one, such as 1.5 or1.75); or by extending that time-slot to begin L1 seconds before thetime-point of the audio bookmark, and to end L3 seconds (e.g., 12seconds) after the pressing of the GUI element of “Time Extension forMost-Recent Audio Bookmark”. This mechanism may enable a participant tosignal to the system, that a particular audio bookmark should beassociated with a time-slot that is greater or longer than the “defaultlength” that is defined for audio bookmarks of that particular user, orthat is defined for audio bookmarks of users in general.

In some embodiments, each participant may configure or modify thetime-lengths of L and/or L1 and/or L2 and/or L3 that are associated withhis own audio bookmarks. For example, the system may provide defaultvalues, such as, L=30 seconds, and L1=20 seconds, and L2=10 seconds, andL3=12 seconds; and those values apply to all users unless modified. UserAdam may configure the system such that for his audio bookmarks, thoseparameters will have other values; in order to enable the system tocapture for him longer or shorter time-slots of audio that areassociated with each of his audio bookmarks.

In some embodiments, each audio-clip or audio-segment that the systemgenerates (e.g., as a discrete audio portion) for each audio bookmark,and/or each transcript-portion that is associated with suchaudio-segment or which each audio bookmark, may be associated with oneor more Attributes or Descriptors that may be set, stored, modified, andlater searched and browsed by the user who created the audiobookmark(s). For example, each audio bookmark may be associated with aName or Title attribute that may be provided or edited by the creator ofthe audio bookmark (e.g., a textual string of “Bob approved thebudget”); with one or more Tags or Keywords (e.g., tag “Budget”, tag“Approval”); and with identifiers of one or more speakers that havespoken during the time-slot associated with that audio bookmark, or whospoke exactly during the time-point in which the bookmarking user haspressed or clicked or touched the GUI element that provides the audiobookmark creation command, and optionally, the Speaker Recognition Unit128 may autonomously and/or automatically recognize or detect oridentify the relevant speaker(s) and may add their identity as a textualannotation that is associated with that audio bookmark as an attributethereof. For example, in an audio conversation of users Adam, Bob andCarla, the user Adam creates an audio bookmark exactly when User Bob isspeaking; the system recognizes automatically that User Bob was thespeaker at that time-point in which User Adam commanded to create hisaudio bookmark; and therefore, the system automatically adds anAttribute or meta-data or descriptor, to that particular audio bookmark,to indicate that the Speaker in that moment was User Bob. TheBookmarking Unit 125 may perform these operation, of creating,modifying, managing, searching and/or browsing audio bookmarks based on(or in association with) Attributes; optionally utilizing an AudioBookmark Attributes Management Unit, which may be part of theBookmarking Unit 125 or may be associated with it. For example, userAdam may later search or browse his audio bookmark, within a particularaudio conversation, or even across several or all of the audioconversations that are accessible to him, by requesting all the audiobookmarks in which the Speaker was User Bob; and the system may provideto User Adam those particular audio bookmarks, and their discrete audioclips and transcript-portion.

In some embodiments, the system may further generate, automatically, a“trailer” or an “audio trailer” or a summary of the audio bookmarks thata particular user had created for a particular audio conversation. Forexample, User Adam has participated in an audio conversation with UserBob and User Carla, which lasted 90 minutes. During that audioconversation, User Adam has created Six audio bookmarks. User Adam maythen utilize a suitable GUI element to command the system to “generatean audio trailer” or “generate a summary of my audio bookmarks”. Inresponse to such command, the system generates Six audio-segments thatcorrespond to those six audio bookmarks that User Adam created, eachaudio-segment having a time-length L (for example, 30 seconds), eachaudio-segment commencing L1 seconds before the time-point of the audiobookmark (e.g., 18 seconds before it), and ending L2 seconds after thetime-point of the audio bookmark (e.g., 12 seconds after it); therebygenerating a set of Six audio-clips, each clip being 30 seconds long;and then automatically merging or joining them or concatenating them tocreate an aggregated 3-minute (180 seconds) audio clip which is an“audio trailer” of that 90-minute audio conversation, reflecting theaudio bookmarks that were of interest to User Adam. The generated “audiotrailer” may be downloaded or saved by User Adam as an audio file, ormay be shared or sent or posted; and may be accompanied by a textualtranscript that corresponds to the speech portions that were said withinthose time-slots that are part of the “audio trailer”, to thus provide atranscript version or a textual representation of that “audio trailer”.It is noted that the system may thus generate, for the same audioconversation: for User Adam, a first “audio trailer” which includes 300seconds of audio, corresponding to 6 audio bookmarks that User Adamcreated; and for User Carla, a second, different, “audio trailer” whichincludes 200 seconds of audio, corresponding to 4 audio bookmarks thatUser Carla created (for the same 90-minute audio conversation); therebyenabling each user or each participant to obtain a summary or an audiotrailer that suits his particular portions-of-interest.

In some embodiments, each user or participant may configure the systemto generate automatically an “audio trailer” that includes only some,but not all, of his own audio bookmarks; for example, based on anAttribute that he selects, or based on an identified speaker, or a tag,or other attribute. For example, even though User Adam has created Sixaudio bookmarks for the 90-minutes conversation, User Adam may requestthe system to generate an audio trailer that corresponds only to theaudio bookmarks that have the word “Budget” in their Title (or, as a Tagor as an Annotation); and in response to such request or command, thesystem selects only some of the six audio bookmarks for which thiscriterion holds true, and generates the audio trailer only for thoseparticular audio bookmarks. In another example, even though User Adamhas created Six audio bookmarks for the 90-minutes conversation, UserAdam may request the system to generate an audio trailer thatcorresponds only to the audio bookmarks in which User Carla was theautomatically-detected Speaker; and in response to such request orcommand, the system selects only some of the six audio bookmarks forwhich this criterion holds true, and generates the audio trailer onlyfor those particular audio bookmarks.

The system may therefore enable each user to create his own audiobookmark(s), for the same audio conversation, reflecting highlights orpoints-of-interests of content-of-interest for that particular user.When several such audio bookmarks or highlights are created (or evenwhen only one such audio bookmark or highlight is created), the systemmay generate audio clips that correspond to time-slots that include thetime-points of those audio bookmarks (for example, each time slotincluding T1 seconds before the time-point and including T2 secondsafter the time-point); and an aggregation or a sequence of those audioclips may be generated, stored, saved, shared, and/or played as audio;thereby generating and playing a virtual summary or a user-specificaudio summary of a meeting based on the points-of-interest that thespecific user had marked; and optionally including the audio captured inthose time-slots (user-specific audio summary) and/or the text of wordsor speech that were spoken in those time-slots (user-specific textualsummary); thus creating a user-specific or personal or private summaryof the audio conversation, which includes only the portions that wereportions-of-interest to this specific user. The method may thuscomprise, for example: generating from said audio conversation, auser-specific audio summary which corresponds to audio captured attime-slots that correspond to the audio bookmarks created by said user,and further generating a user-specific textual summary which correspondsto text of the audio that was captured in said time-slots; wherein adifferent user-specific audio summary, and a different user-specifictextual summary, are generated and then played or displayed separatelyto each user based on the audio bookmarks that said user created.

Some embodiments may include a method (or a computerized method, or acomputerized process) comprising: (a) during an ongoing audioconversation, receiving from an electronic device of a participant, atactile input via a touch-screen of the electronic device or via aphysical input unit of the electronic device, of an audio bookmarkingcommand that indicates a request of said participant to create an audiobookmark, wherein step (a) is performed in real time while recordingsaid audio conversation; (b) determining a time-point within said audioconversation, at which said user-input was received; (c) automaticallygenerating for said participant a user-specific set of audio bookmarksfor a recorded version of said audio conversation. Each audio bookmarkin the set of audio bookmarks, points at a particular time-point withinthe recorded version of said audio conversation in which saidparticipant provided an audio bookmarking command.

In some embodiments, the method comprises: receiving from saidparticipant, at a first time-point T1 within said ongoing audioconversation, a first audio bookmarking command; receiving from saidparticipant, at a second time-point T2 within said ongoing audioconversation, a second audio bookmarking command; automaticallygenerating a set of audio bookmarks, which comprises: (i) a first audiobookmark that points to the first time-point T1 in the recorded versionof said audio conversation, and (ii) a second audio bookmark that pointsto the second time-point T2 in the recorded version of said audioconversation.

In some embodiments, the method comprises: generating, for a firstparticipant in said audio conversation, a first set of audio bookmarksthat correspond to audio time-points that said first participantcommanded to bookmark; generating separately, for a second participantin said audio conversation, a second and separate set of audio bookmarksthat correspond to audio time-points that said second participantcommanded to bookmark.

In some embodiments, the method comprises: wherein each of the first setand the second set is a private set of audio bookmarks; wherein themethod comprises: authorizing only the first participant to access thefirst set of audio bookmarks; authorizing only the second participant toaccess the second set of audio bookmarks.

In some embodiments, the method comprises: (d) for a set of audiobookmarks having N audio bookmarks of said participant, automaticallygenerating a set of N separate audio-clips; wherein each generatedaudio-clip includes a time-constrained portion of the recorded versionof said audio conversation that includes audio that was capturedsurrounding a time-point of a single audio bookmark.

In some embodiments, the method comprises: (d) for a set of audiobookmarks having N audio bookmarks of said participant, automaticallygenerating a set of N separate audio-clips; wherein each generatedaudio-clip includes a time-constrained portion of the recorded versionof said audio conversation that includes audio that was capturedsurrounding a time-point of a single audio bookmark, wherein saidtime-constrained portion of the recorded audio version includes recordedaudio that was recorded P1 seconds immediately before the time-point andP2 seconds immediately after the time-point, wherein P1 and P2 arepre-defined values.

In some embodiments, the method comprises: (e) based on a textual inputreceived from said participant, assigning a particular name or aparticular tag to a particular audio-clip of said N separateautomatically-generated audio-clips.

In some embodiments, the method comprises: performing a search foraudio-clips that are associated with a particular name or tag.

In some embodiments, the method comprises: for each audio bookmarkcreated by said participant, applying a Speaker Identification processthat identifies a speaker at a time-point that was audio bookmarked; andextracting from the recorded version of the conversation, an audiosegment that surrounds said time-point and that includes aspeech-segment made by said speaker around said time-point.

In some embodiments, the method comprises: for each audio bookmarkcreated by said participant, applying a Speaker Identification processthat identifies a speaker at a time-point that was audio bookmarked; andextracting from the recorded version of the conversation, an audiosegment that surrounds said time-point and that includes only aspeech-segment made by said speaker around said time-point and thatexcludes speech by any other participants.

In some embodiments, the method comprises: automatically generating atextual transcript of said audio conversation, by utilizing aspeech-to-text transcription process; for each audio bookmark created bysaid participant, extracting from said textual transcript, a transcriptportion that corresponds to a pre-defined time-slot that surrounds thetime-point of the audio bookmark; and providing said transcript portionfor copying or sharing.

In some embodiments, the method comprises: automatically generating atextual transcript of said audio conversation, by utilizing aspeech-to-text transcription process; for each audio bookmark created bysaid participant, extracting from said textual transcript, a transcriptportion that corresponds to a pre-defined time-slot that surrounds thetime-point of the audio bookmark;

generating a collection of all the transcript portions, that correspondto speech segments that were said during a time-slot that surrounds thetime-point of each audio bookmark.

In some embodiments, the method comprises: associating between (I) eachaudio bookmark created by said participant, and (II) one or moreattributes that are set by said participant; wherein said one or moreattributes comprise at least one of: a title, a name, an annotation, atextual descriptor, a speaker name, a topic, a tag; enabling saidparticipant to search or to browse audio bookmarks, based on one or moreattributes.

In some embodiments, the method comprises: associating between (I) eachaudio bookmark created by said participant, and (II) a descriptor of aspeaker who spoke during the time-slot of said audio bookmark and thatwas recognized automatically by a speaker recognition unit; enablingsaid participant to search or to browse audio bookmarks based on aspeaker descriptor.

In some embodiments, the method comprises: automatically generating forsaid participant an audio trailer of said audio conversation, byaggregating audio-segments of the audio-conversation, each audio-segmentcorresponding to a time-slot of the audio conversation that surroundsthe time-point of the audio bookmark; wherein the audio trailer isprovided to said participant as at least one of: (i) an audio clip fordownloading or sharing, (ii) a textual transcript.

In some embodiments, the method comprises: automatically generating forsaid participant, a selective audio trailer of said audio conversationthat is based on a particular speaker, by aggregating audio-segments ofthe audio-conversation in which a speaker recognition unit hadrecognized said particular speaker; each audio-segment corresponding toa time-slot of the audio conversation that surrounds the time-point ofthe audio bookmark; wherein the selective audio trailer is provided tosaid participant as at least one of: (i) an audio clip for downloadingor sharing, (ii) a textual transcript.

In some embodiments, the method comprises: receiving from saidparticipant, while said audio conversation is ongoing, an extensioncommand to extend a time-slot that will be associated with arecently-commanded audio bookmark; in response to said extensioncommand, extending a time-length of the time-slot that is associatedwith the recently-commanded audio bookmark.

In some embodiments, the method comprises: generating from said audioconversation, a user-specific audio summary which corresponds to audiocaptured at time-slots that correspond to the audio bookmarks created bysaid user, and further generating a user-specific textual summary whichcorresponds to text of the audio that was captured in said time-slots;wherein a different user-specific audio summary, and a differentuser-specific textual summary, are generated and then played ordisplayed separately to each user based on the audio bookmarks that saiduser created.

Some embodiments may include a non-transitory storage medium havingstored thereon instructions that, when executed by one or moreprocessors, cause the one or more processors to perform a method asdescribed above.

Some embodiments may include a system comprising: one or moreprocessors, operably associated with one or more memory units; whereinthe one or more processors are configured: (a) during an ongoing audioconversation, to receive from an electronic device of a participant, atactile input via a touch-screen of the electronic device or via aphysical input unit of the electronic device, of an audio bookmarkingcommand that indicates a request of said participant to create an audiobookmark, wherein (a) is performed in real time while recording saidaudio conversation; (b) to determine a time-point within said audioconversation, at which said user-input was received; (c) toautomatically generate for said participant a user-specific set of audiobookmarks for a recorded version of said audio conversation; whereineach audio bookmark in the set of audio bookmarks, points at aparticular time-point within the recorded version of said audioconversation in which said participant provided an audio bookmarkingcommand.

In accordance with the present invention, each one of the devices orunits of system 100 may be implemented by using (or may comprise) one ormore hardware units and/or software units, processors, CPUs, DSPs,integrated circuits, memory units, storage units, wireless communicationmodems or transmitters or receivers or transceivers, cellulartransceivers, a power source, input units, output units, OperatingSystem (OS), drivers, applications, and/or other suitable components.

Some embodiments comprise a non-transitory storage medium or storagearticle having stored thereon instructions that, when executed by ahardware processor, cause the hardware processor to perform a method asdescribed above and/or herein.

Although portions of the discussion herein relate, for demonstrativepurposes, to wired links and/or wired communications, some embodimentsof the present invention are not limited in this regard, and may includeone or more wired or wireless links, may utilize one or more componentsof wireless communication, may utilize one or more methods or protocolsof wireless communication, or the like. Some embodiments may utilizewired communication and/or wireless communication.

The present invention may be implemented by using hardware units,software units, processors, CPUs, DSPs, integrated circuits, memoryunits, storage units, wireless communication modems or transmitters orreceivers or transceivers, cellular transceivers, a power source, inputunits, output units, Operating System (OS), drivers, applications,and/or other suitable components.

The present invention may be implemented by using a special-purposemachine or a specific-purpose that is not a generic computer, or byusing a non-generic computer or a non-general computer or machine. Suchsystem or device may utilize or may comprise one or more units ormodules that are not part of a “generic computer” and that are not partof a “general purpose computer”, for example, cellular transceivers,cellular transmitter, cellular receiver, GPS unit, location-determiningunit, accelerometer(s), gyroscope(s), device-orientation detectors orsensors, device-positioning detectors or sensors, or the like.

The present invention may be implemented by using code or program codeor machine-readable instructions or machine-readable code, which isstored on a non-transitory storage medium or non-transitory storagearticle (e.g., a CD-ROM, a DVD-ROM, a physical memory unit, a physicalstorage unit), such that the program or code or instructions, whenexecuted by a processor or a machine or a computer, cause such device toperform a method in accordance with the present invention.

Embodiments of the present invention may be utilized with a variety ofdevices or systems having a touch-screen or a touch-sensitive surface;for example, a smartphone, a cellular phone, a mobile phone, asmart-watch, a tablet, a handheld device, a portable electronic device,a portable gaming device, a portable audio/video player, an AugmentedReality (AR) device or headset or gear, a Virtual Reality (VR) device orheadset or gear, a “kiosk” type device, a vending machine, an AutomaticTeller Machine (ATM), a laptop computer, a desktop computer, a vehicularcomputer, a vehicular dashboard, a vehicular touch-screen, or the like.

The system(s) and/or device(s) of the present invention may optionallycomprise, or may be implemented by utilizing suitable hardwarecomponents and/or software components; for example, processors,processor cores, Central Processing Units (CPUs), Digital SignalProcessors (DSPs), circuits, Integrated Circuits (ICs), controllers,memory units, registers, accumulators, storage units, input units (e.g.,touch-screen, keyboard, keypad, stylus, mouse, touchpad, joystick,trackball, microphones), output units (e.g., screen, touch-screen,monitor, display unit, audio speakers), acoustic microphone(s) and/orsensor(s), optical microphone(s) and/or sensor(s), laser or laser-basedmicrophone(s) and/or sensor(s), wired or wireless modems or transceiversor transmitters or receivers, GPS receiver or GPS element or otherlocation-based or location-determining unit or system, network elements(e.g., routers, switches, hubs, antennas), and/or other suitablecomponents and/or modules.

The system(s) and/or devices of the present invention may optionally beimplemented by utilizing co-located components, remote components ormodules, “cloud computing” servers or devices or storage, client/serverarchitecture, peer-to-peer architecture, distributed architecture,and/or other suitable architectures or system topologies or networktopologies.

In accordance with embodiments of the present invention, calculations,operations and/or determinations may be performed locally within asingle device, or may be performed by or across multiple devices, or maybe performed partially locally and partially remotely (e.g., at a remoteserver) by optionally utilizing a communication channel to exchange rawdata and/or processed data and/or processing results.

Some embodiments may be implemented by using a special-purpose machineor a specific-purpose device that is not a generic computer, or by usinga non-generic computer or a non-general computer or machine. Such systemor device may utilize or may comprise one or more components or units ormodules that are not part of a “generic computer” and that are not partof a “general purpose computer”, for example, cellular transceivers,cellular transmitter, cellular receiver, GPS unit, location-determiningunit, accelerometer(s), gyroscope(s), device-orientation detectors orsensors, device-positioning detectors or sensors, or the like.

Some embodiments may be implemented as, or by utilizing, an automatedmethod or automated process, or a machine-implemented method or process,or as a semi-automated or partially-automated method or process, or as aset of steps or operations which may be executed or performed by acomputer or machine or system or other device.

Some embodiments may be implemented by using code or program code ormachine-readable instructions or machine-readable code, which may bestored on a non-transitory storage medium or non-transitory storagearticle (e.g., a CD-ROM, a DVD-ROM, a physical memory unit, a physicalstorage unit), such that the program or code or instructions, whenexecuted by a processor or a machine or a computer, cause such processoror machine or computer to perform a method or process as describedherein. Such code or instructions may be or may comprise, for example,one or more of: software, a software module, an application, a program,a subroutine, instructions, an instruction set, computing code, words,values, symbols, strings, variables, source code, compiled code,interpreted code, executable code, static code, dynamic code; including(but not limited to) code or instructions in high-level programminglanguage, low-level programming language, object-oriented programminglanguage, visual programming language, compiled programming language,interpreted programming language, C, C++, C #, Java, JavaScript, SQL,Ruby on Rails, Go, Cobol, Fortran, ActionScript, AJAX, XML, JSON, Lisp,Eiffel, Verilog, Hardware Description Language (HDL), BASIC, VisualBASIC, Matlab, Pascal, HTML, HTML5, CSS, Perl, Python, PHP, machinelanguage, machine code, assembly language, or the like.

Discussions herein utilizing terms such as, for example, “processing”,“computing”, “calculating”, “determining”, “establishing”, “analyzing”,“checking”, “detecting”, “measuring”, or the like, may refer tooperation(s) and/or process(es) of a processor, a computer, a computingplatform, a computing system, or other electronic device or computingdevice, that may automatically and/or autonomously manipulate and/ortransform data represented as physical (e.g., electronic) quantitieswithin registers and/or accumulators and/or memory units and/or storageunits into other data or that may perform other suitable operations.

Some embodiments of the present invention may perform steps oroperations such as, for example, “determining”, “identifying”,“comparing”, “checking”, “querying”, “searching”, “matching”, and/or“analyzing”, by utilizing, for example: a pre-defined threshold value towhich one or more parameter values may be compared; a comparison between(i) sensed or measured or calculated value(s), and (ii) pre-defined ordynamically-generated threshold value(s) and/or range values and/orupper limit value and/or lower limit value and/or maximum value and/orminimum value; a comparison or matching between sensed or measured orcalculated data, and one or more values as stored in a look-up table ora legend table or a list of reference value(s) or a database ofreference values or ranges; a comparison or matching or searchingprocess which searches for matches and/or identical results and/orsimilar results and/or sufficiently-close results, among multiple valuesor limits that are stored in a database or look-up table; utilization ofone or more equations, formula, weighted formula, and/or othercalculation in order to determine similarity or a match between or amongparameters or values; utilization of comparator units, lookup tables,threshold values, conditions, conditioning logic, Boolean operator(s)and/or other suitable components and/or operations.

The terms “plurality” and “a plurality”, as used herein, include, forexample, “multiple” or “two or more”. For example, “a plurality ofitems” includes two or more items.

References to “one embodiment”, “an embodiment”, “demonstrativeembodiment”, “various embodiments”, “some embodiments”, and/or similarterms, may indicate that the embodiment(s) so described may optionallyinclude a particular feature, structure, or characteristic, but notevery embodiment necessarily includes the particular feature, structure,or characteristic. Repeated use of the phrase “in one embodiment” doesnot necessarily refer to the same embodiment, although it may. Repeateduse of the phrase “in some embodiments” does not necessarily refer tothe same set or group of embodiments, although it may.

As used herein, and unless otherwise specified, the utilization ofordinal adjectives such as “first”, “second”, “third”, “fourth”, and soforth, to describe an item or an object, merely indicates that differentinstances of such like items or objects are being referred to; and doesnot intend to imply as if the items or objects so described must be in aparticular given sequence, either temporally, spatially, in ranking, orin any other ordering manner.

Some embodiments may comprise, or may be implemented by using, an “app”or application which may be downloaded or obtained from an “app store”or “applications store”, for free or for a fee, or which may bepre-installed on a computing device or electronic device, or which maybe transported to and/or installed on such computing device orelectronic device.

Functions, operations, components and/or features described herein withreference to one or more embodiments of the present invention, may becombined with, or may be utilized in combination with, one or more otherfunctions, operations, components and/or features described herein withreference to one or more other embodiments of the present invention. Thepresent invention may comprise any possible combinations,re-arrangements, assembly, re-assembly, or other utilization of some orall of the modules or functions or components that are described herein,even if they are discussed in different locations or different chaptersof the above discussion, or even if they are shown across differentdrawings or multiple drawings.

While certain features of the present invention have been illustratedand described herein, many modifications, substitutions, changes, andequivalents may occur to those skilled in the art. Accordingly, theclaims are intended to cover all such modifications, substitutions,changes, and equivalents.

What is claimed is:
 1. A method comprising: (a) during an ongoing audioconversation, receiving from an electronic device of a participant, atactile input via a touch-screen of the electronic device or via aphysical input unit of the electronic device, of an audio bookmarkingcommand that indicates a request of said participant to create an audiobookmark, wherein step (a) is performed in real time while recordingsaid audio conversation; (b) determining a time-point within said audioconversation, at which said tactile input was received; (c)automatically generating for said participant a user-specific set ofaudio bookmarks for a recorded version of said audio conversation,wherein each audio bookmark in the set of audio bookmarks, points at aparticular time-point within the recorded version of said audioconversation in which said participant provided an audio bookmarkingcommand; wherein the method further comprises: associating between (I)each audio bookmark created by said participant, and (II) a descriptorof a speaker who spoke during a time-slot of said audio bookmark andthat was recognized automatically by a speaker recognition unit;enabling said participant to search or to browse audio bookmarks basedon a speaker descriptor.
 2. The method of claim 1, wherein the methodcomprises: receiving from said participant, at a first time-point T1within said ongoing audio conversation, a first audio bookmarkingcommand; receiving from said participant, at a second time-point T2within said ongoing audio conversation, a second audio bookmarkingcommand; automatically generating a set of audio bookmarks, whichcomprises: (i) a first audio bookmark that points to the firsttime-point T1 in the recorded version of said audio conversation, and(ii) a second audio bookmark that points to the second time-point T2 inthe recorded version of said audio conversation.
 3. The method of claim2, further comprising: generating, for a first participant in said audioconversation, a first set of audio bookmarks that correspond to audiotime-points that said first participant commanded to bookmark;generating separately, for a second participant in said audioconversation, a second and separate set of audio bookmarks thatcorrespond to audio time-points that said second participant commandedto bookmark.
 4. The method of claim 3, wherein each of the first set andthe second set is a private set of audio bookmarks, wherein the methodcomprises: authorizing only the first participant to access the firstset of audio bookmarks; authorizing only the second participant toaccess the second set of audio bookmarks.
 5. The method of claim 1,further comprising: (d) for a set of audio bookmarks having N audiobookmarks of said participant, automatically generating a set of Nseparate audio-clips, wherein each generated audio-clip includes atime-constrained portion of the recorded version of said audioconversation that includes audio that was captured surrounding atime-point of a single audio bookmark, wherein said time-constrainedportion of the recorded audio version includes recorded audio that wasrecorded P1 seconds immediately before the time-point and P2 secondsimmediately after the time-point, wherein P1 and P2 are pre-definedvalues.
 6. The method of claim 1, further comprising: for each audiobookmark created by said participant, applying a Speaker Identificationprocess that identifies a speaker at a time-point that was audiobookmarked; and extracting from the recorded version of theconversation, an audio segment that surrounds said time-point and thatincludes a speech-segment made by said speaker around said time-point.7. The method of claim 1, further comprising: for each audio bookmarkcreated by said participant, applying a Speaker Identification processthat identifies a speaker at a time-point that was audio bookmarked; andextracting from the recorded version of the conversation, an audiosegment that surrounds said time-point and that includes only aspeech-segment made by said speaker around said time-point and thatexcludes speech by any other participants.
 8. The method of claim 1,further comprising: automatically generating a textual transcript ofsaid audio conversation, by utilizing a speech-to-text transcriptionprocess; for each audio bookmark created by said participant, extractingfrom said textual transcript, a transcript portion that corresponds to apre-defined time-slot that surrounds the time-point of the audiobookmark; and providing said transcript portion for copying or sharing.9. The method of claim 1, further comprising: automatically generating atextual transcript of said audio conversation, by utilizing aspeech-to-text transcription process; for each audio bookmark created bysaid participant, extracting from said textual transcript, a transcriptportion that corresponds to a pre-defined time-slot that surrounds thetime-point of the audio bookmark; generating a collection of all thetranscript portions, that correspond to speech segments that were saidduring a time-slot that surrounds the time-point of each audio bookmark.10. The method of claim 1, further comprising: associating between (I)each audio bookmark created by said participant, and (II) one or moreattributes that are set by said participant; wherein said one or moreattributes comprise at least one of: a title, a name, an annotation, atextual descriptor, a speaker name, a topic, a tag; enabling saidparticipant to search or to browse audio bookmarks, based on one or moreattributes.
 11. A method comprising: (a) during an ongoing audioconversation, receiving from an electronic device of a participant, atactile input via a touch-screen of the electronic device or via aphysical input unit of the electronic device, of an audio bookmarkingcommand that indicates a request of said participant to create an audiobookmark, wherein step (a) is performed in real time while recordingsaid audio conversation; (b) determining a time-point within said audioconversation, at which said tactile input was received; (c)automatically generating for said participant a user-specific set ofaudio bookmarks for a recorded version of said audio conversation,wherein each audio bookmark in the set of audio bookmarks, points at aparticular time-point within the recorded version of said audioconversation in which said participant provided an audio bookmarkingcommand; (d) for a set of audio bookmarks having N audio bookmarks ofsaid participant, automatically generating a set of N separateaudio-clips, wherein each generated audio-clip includes atime-constrained portion of the recorded version of said audioconversation that includes audio that was captured surrounding atime-point of a single audio bookmark; (e) based on a textual inputreceived from said participant, assigning a particular name or aparticular tag to a particular audio-clip of said N separateautomatically-generated audio-clips.
 12. The method of claim 11, furthercomprising: performing a search for audio-clips that are associated witha particular name or tag.
 13. A method comprising: (a) during an ongoingaudio conversation, receiving from an electronic device of aparticipant, a tactile input via a touch-screen of the electronic deviceor via a physical input unit of the electronic device, of an audiobookmarking command that indicates a request of said participant tocreate an audio bookmark, wherein step (a) is performed in real timewhile recording said audio conversation; (b) determining a time-pointwithin said audio conversation, at which said tactile input wasreceived; (c) automatically generating for said participant auser-specific set of audio bookmarks for a recorded version of saidaudio conversation, wherein each audio bookmark in the set of audiobookmarks, points at a particular time-point within the recorded versionof said audio conversation in which said participant provided an audiobookmarking command; wherein the method further comprises: automaticallygenerating for said participant an audio trailer of said audioconversation, by aggregating audio-segments of the audio-conversation,each audio-segment corresponding to a time-slot of the audioconversation that surrounds the time-point of the audio bookmark;wherein the audio trailer is provided to said participant as at leastone of: (i) an audio clip for downloading or sharing, (ii) a textualtranscript.
 14. A method comprising: (a) during an ongoing audioconversation, receiving from an electronic device of a participant, atactile input via a touch-screen of the electronic device or via aphysical input unit of the electronic device, of an audio bookmarkingcommand that indicates a request of said participant to create an audiobookmark, wherein step (a) is performed in real time while recordingsaid audio conversation; (b) determining a time-point within said audioconversation, at which said tactile input was received; (c)automatically generating for said participant a user-specific set ofaudio bookmarks for a recorded version of said audio conversation,wherein each audio bookmark in the set of audio bookmarks, points at aparticular time-point within the recorded version of said audioconversation in which said participant provided an audio bookmarkingcommand; wherein the method further comprises: automatically generatingfor said participant, a selective audio trailer of said audioconversation that is based on a particular speaker, by aggregatingaudio-segments of the audio-conversation in which a speaker recognitionunit had recognized said particular speaker, each audio-segmentcorresponding to a time-slot of the audio conversation that surroundsthe time-point of the audio bookmark; wherein the selective audiotrailer is provided to said participant as at least one of: (i) an audioclip for downloading or sharing, (ii) a textual transcript.
 15. A methodcomprising: (a) during an ongoing audio conversation, receiving from anelectronic device of a participant, a tactile input via a touch-screenof the electronic device or via a physical input unit of the electronicdevice, of an audio bookmarking command that indicates a request of saidparticipant to create an audio bookmark, wherein step (a) is performedin real time while recording said audio conversation; (b) determining atime-point within said audio conversation, at which said tactile inputwas received; (c) automatically generating for said participant auser-specific set of audio bookmarks for a recorded version of saidaudio conversation, wherein each audio bookmark in the set of audiobookmarks, points at a particular time-point within the recorded versionof said audio conversation in which said participant provided an audiobookmarking command; wherein the method further comprises: receivingfrom said participant, while said audio conversation is ongoing, anextension command to extend a time-slot that will be associated with arecently-commanded audio bookmark; in response to said extensioncommand, extending a time-length of the time-slot that is associatedwith the recently-commanded audio bookmark.
 16. The A method of claim 1,further comprising: (a) during an ongoing audio conversation, receivingfrom an electronic device of a participant, a tactile input via atouch-screen of the electronic device or via a physical input unit ofthe electronic device, of an audio bookmarking command that indicates arequest of said participant to create an audio bookmark, wherein step(a) is performed in real time while recording said audio conversation;(b) determining a time-point within said audio conversation, at whichsaid tactile input was received; (c) automatically generating for saidparticipant a user-specific set of audio bookmarks for a recordedversion of said audio conversation, wherein each audio bookmark in theset of audio bookmarks, points at a particular time-point within therecorded version of said audio conversation in which said participantprovided an audio bookmarking command; wherein the method furthercomprises: generating from said audio conversation, a user-specificaudio summary which corresponds to audio captured at time-slots thatcorrespond to the audio bookmarks created by said user, and furthergenerating a user-specific textual summary which corresponds to text ofthe audio that was captured in said time-slots; wherein a differentuser-specific audio summary, and a different user-specific textualsummary, are generated and then played or displayed separately to eachuser based on the audio bookmarks that said user created.